decoder call
preliminary experiments with VIMCO which does not seem to outperform moving average baseline on the bit-vector
We thank all reviewers for their comments. In 5.1, we compare against Sum&Sample, which was shown in prior work [Figure 1 of Liu et al., 2019] Upon acceptance we will normalize these baselines for all experiments and include suggested ones. Thanks for bringing this up, scalability is an important point that we want to make sure is clear in the final version. We will make this clearer. We will include this analysis and plots as suggested.
Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity
Correia, Gonçalo M., Niculae, Vlad, Aziz, Wilker, Martins, André F. T.
Training neural network models with discrete (categorical or structured) latent variables can be computationally challenging, due to the need for marginalization over large or combinatorial sets. To circumvent this issue, one typically resorts to sampling-based approximations of the true marginal, requiring noisy gradient estimators (e.g., score function estimator) or continuous relaxations with lower-variance reparameterized gradients (e.g., Gumbel-Softmax). In this paper, we propose a new training strategy which replaces these estimators by an exact yet efficient marginalization. To achieve this, we parameterize discrete distributions over latent assignments using differentiable sparse mappings: sparsemax and its structured counterparts. In effect, the support of these distributions is greatly reduced, which enables efficient marginalization. We report successful results in three tasks covering a range of latent variable modeling applications: a semisupervised deep generative model, a latent communication game, and a generative model with a bit-vector latent representation. In all cases, we obtain good performance while still achieving the practicality of sampling-based approximations.
- Europe > Portugal > Lisbon > Lisbon (0.04)
- Asia > Middle East > Jordan (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- (3 more...)